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Abstract. We announce the detection of correlations on angular scales of ;> 10' be- 
tween optically bright, high-redshift, radio-loud QSOs with diffuse X-ray emission seen 
by ROSAT in the All-Sky Survey. These correlations reach significance levels of up 
to 99.8%. A comparison of the results with a sample of control fields, bootstrapping 
analyses, and Kolmogorov-Smirnov tests provide unambiguous support for the statisti- 
cal significance of the correlations found. We argue that the detected enhanced diffuse 
X-ray emission is in the foreground of the QSOs, and that it is probably due to galaxy 
clusters which magnify the QSOs by their gravitational lensing effect, thereby giving rise 
to a magnification bias in the background source sample. A comparison of the results 
presented below with correlations previously found between the same QSO sample and 
either Lick or IRAS galaxies provides further evidence for this interpretation, and iden- 
tifies positions in the sky where weak gravitational lensing may be detected by searching 
for coherent distortions of background galaxy images. 
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1 Introduction 

Previous analyses of correlations between foreground galaxies and distant, optically 
bright, radio-loud QSOs on angular scales of ;> 10' (hereafter called large-scale cor- 
relations) have shown that these QSOs are correlated with galaxies selected by either 
infrared or optical emission (Bartelmann & Schneider 1993b, 1994; henceforth BS2,3). 
Trends in the results as well as their qualitative agreement with a previously performed 
theoretical investigation (Bartelmann & Schneider 1993a, henceforth BS1) provide evi- 
dence that these correlations are caused by the magnification bias due to weak lensing 
by extended inhomogeneities in the Universe, on scales of galaxy clusters or larger. 

Clusters, probably consisting mainly of dark matter, are usually X-ray bright objects, 
with luminosities in the range Lx € [10 43 . . . 10 45 ] ergs/s (e.g., Sarazin 1992 and references 
therein). Therefore, if clusters are responsible for the large-scale correlations in question, 
one should be able to see enhanced diffuse X-ray emission in the vicinity of bright and 
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distant, presumably weakly lensed background sources. This paper describes the results 
of a correlation analysis on the basis of data from the ROSAT All-Sky Survey (see, e.g., 
Voges 1992). 

We employ the rank-order correlation test described and applied in BS1,2,3, which 
has proven to be highly sensitive to detect weak correlations. The advantage of the 
rank-order correlation test is that it is a non-parametric, distribution-free test which is 
unaffected by linear transformations of the data sets tested (and therefore unaffected 
by normalization problems); its disadvantage is that it yields a 'binary' result, since it 
merely states whether there are significant correlations or not. 

The paper is organized in the following way. Sect. 2 briefly describes the selection 
of sources and of control fields. Sect. 3 gives the average X-ray photon number densities 
retrieved from the ROSAT All-Sky Survey in those fields. Sect. 4 presents the rank- 
order correlation results and a bootstrapping test of their reliability, and Sect. 5 describes 
Kolmogorov-Smirnov tests further supporting the results of Sect. 4. Sect. 6 provides an 
explanation of a surprising feature detected in the analysis of the control fields. In 
Sect. 7, we introduce further statistical tests with the focus on illustrating the rank-order 
correlation results. Sect. 8 is devoted to a search for such background sources which either 
are close to known X-ray bright galaxy clusters or which appear significantly correlated 
with X-ray emission and galaxies. Finally, Sect. 9 summarizes and discusses the results. 

2 Selection of sources and control fields 

We use the same background source sample as in BS2,3, namely the optically identified 
1-Jansky sources (Kiihr et al. 1981, Stickel 1992, Stickel & Kiihr 1993a,b) with redshifts 
above 0.5; see BS2,3 for a more detailed description of this sample. The sample contains 
246 sources, most of which are fiat-spectrum QSOs. The numbers and positions of X-ray 
photons in the ROSAT energy range E 1 e [0.1, 2.2] keV in fields of 2° x 2° size centered 
on these sources were obtained from the All-Sky Survey, together with an equal number 
of equally sized control fields with the same galactic latitude as the source fields, but 
with randomly chosen galactic longitudes. The control fields were chosen such that the 
contamination by galactic foreground emission should on average be equal in the source 
and in the control fields. 

3 Mean photon counts 

In order to avoid severe contamination of the results by anisotropic X-ray noise, we 
restrict the analysis to photons with energies > 0.75 keV. The anisotropic noise in the 
All-Sky Survey is primarily due to line emission of atmospheric oxygen at 0.54 keV. Since 
it is therefore mainly caused by the Earth's atmosphere, it depends on the attitude of 
the ROSAT telescope relative to the Earth and the Sun, resulting in a pattern of stripes 
irregularly covering the X-ray sky as seen in the All-Sky Survey. Above ~ 0.75 keV, these 
stripes dim and disappear such that the X-ray sky is basically unaffected by atmospheric 
(and therefore attitude-dependent) X-ray noise in the energy ranges which we consider 
here (Freyberg 1993). We distinguish between three different energy ranges of the X-ray 
photons, namely E 1 > 0.75 keV, E 1 > 1.0 keV, and E 1 > 1.5 keV (later on called 'soft', 
'medium', and 'hard band', respectively). 
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Fig. 1. Radial photon-count profiles 
obtained from stacks of source fields, com- 
pared with the theoretical point-spread 
function for the All-Sky Survey. The three 
curves are the theoretical PSF (solid line) 
and the radial profiles of the QSO X-ray 
emission for redshifts z £ [1.0, 1.5] and z £ 
[1.5, 2.0] (dotted and dashed curves, respec- 
tively); only photons with > 0.75 keV 
were selected. The background is subtracted 
and the curves are normalized to unity 
within 10'. The error bars on the dashed 
curve are Poisson errors 

On average, there is only weak emission in the source as well as in the control fields. 
The mean photon counts in source and control fields are statistically identical, irrespective 
of the energy band. We obtain an average photon density (n 7 ) ~ 0.19, 0.12, 0.05 (')~ 2 in 
the soft, medium, and hard regimes, respectively. 

The photon number density in a field depends, of course, also on the exposure time 
of this field. Therefore, it is enhanced in fields close to the ecliptic poles, which are 
covered by ROSAT once per orbit. This does, however, not affect the later correlation 
analysis since the rank-order test is insensitive to linear transformations of the data. 

To illustrate the point-spread function, we display in Fig.l the theoretical point- 
spread function for the All-Sky Survey (solid line; cf. Hasinger et al. 1992) and two 
radial photon-count profiles obtained from stacks of source fields for E 1 > 0.75 keV. 
These two curves are for QSOs within redshift bins z € [0.5,1.0] and z e [1.0,1.5] 
(dotted and dashed curves, respectively). The background is subtracted, and the curves 
are normalized to unity within 10'. The point-spread function is sufficiently narrow to 
ensure that outside r ^> 5'.6 1 there is only negligible contamination from the central 
sources themselves. Within the error bars, the radial source profiles agree well with 
the theoretical PSF, except perhaps for the innermost data point. However, the source 
profiles were obtained from pixelized image stacks, which may cause a misalignment of 
the order of one pixel (30") between pixel centers and the true source positions, affecting 
the height of the profile. 

4 Rank-order correlation results 

The rank-order correlation test is performed as described in BS1,2,3. Briefly, the method 
proceeds as follows. A field in the sky centered on the position of a (background) source 
is covered by a pattern of iV ce ii (we choose iV ce ii = 25) equal-area cells, which, where 
not stated otherwise, are chosen to be rings of area 100 square arcminutes concentric 
with the source position. To avoid contamination from the emission of the background 
sources themselves, we later cut out the central cell and restrict the correlation analysis 
to the A^ ell = 24 surrounding cells, covering a ring between radii 5'.6 and 28'.2. The 
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distances of these rings to the center are ranked in descending order, providing a set of 
distance ranks {TZ^}- Then, the 'events' (galaxies in BS1,2,3 or X-ray photons in this 
paper) in these cells are counted and assigned count ranks 1Z C in ascending order. The 
two sets of ranks per field, {TZd} and {7l c }, define a hierarchy of 'closeness' to the sources 
and of counts, respectively. These two hierarchies are compared, and the comparison is 
quantified by a correlation coefficient r corr G [—1,1]. The statistical properties of r corr 
are known, and therefore r corr can be uniquely translated into an error level e£ [0,1], e is 
the error probability for the correlation hypothesis, or, conversely, (1 — e) is the statistical 
significance of the correlation result. For further details on this correlation technique, 
the reader is referred to BS1,2,3 or Kendall & Stuart (1973). 

The rank-order correlation test can be performed either with individual sources, 
yielding one correlation coefficient, and thus one error level per source, or with the 
averaged hierarchies of a source subsample, yielding the correlation error level of the 
whole subsample. In Fig. 2, we show examples of source and control fields which were 
selected for either low or high correlation coefficients. As in BS2,3, we define source 
subsamples by three parameters, namely an optical (visual) magnitude threshold m max , 
controlling the optical brightness limit of the subsample, and the bounds of a redshift 
interval [z m i n ,z max ], controlling the average distance of the subsample. Where z max is 
not given, it is set to infinity. 

Subsamples can equally be defined for the control fields, since there is one control 
field per source. We assign each control field the redshift and the optical magnitude of 
the corresponding source. In this case, the subsample parameters have the sole effect of 
reducing the field number by the same amount as in the source subsamples. 

To estimate the error Ae of e obtained from either source or control fields, we 
perform a bootstrapping analysis. This is done by randomly selecting from the N sources 
or control fields contained in the subsample N fields, but allowing fields to repeatedly 
enter the bootstrapped sample (see, e.g., Press et al. 1992 and references therein). This 
yields a (cumulative) distribution P(e) of error levels per subsample, with P(100%) = 1, 
P(0%) = 0. We call the median e' and choose as an error estimate the range Ae within 
which P(e) G [0.1,0.9]. 

Fig. 3 shows e, e' , and Ae obtained from source and control fields for finite z max , 
Fig. 4 shows the results for z max = oo. 

First of all, the figures clearly show evidence for a difference between the results 
obtained from source fields [left columns in Figs. (3,4)] and those obtained from control 
fields [right columns in Figs. (3,4)]. In particular, Fig. 3 shows that 

(1) visually bright (m max <J 19), close (0.5 < z < 1) 1-Jansky sources are correlated 
with soft- and medium-energy X-ray photons, 

(2) 1-Jansky sources in the intermediate redshift interval (1.0 < z < 1.5) do not show 
significant correlation with X-ray photons irrespective of their visual brightness and 
of the photon energy, 

(3) distant 1-Jansky sources (1.5 < z < 2.0) are strongly correlated with soft and hard 
X-ray photons, and 

(4) the results from the control fields show no indication for any significant correlation 
irrespective of the X-ray photon energy. 

(5) The correlations show a trend to increase with increasing optical-brightness threshold 
of the QSO subsamples. 
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Fig. 2 Examples of source and control fields. Panels (a) to (c): source fields with the highest r cor r, 
panels (d) to (f): source fields with the lowest r cor r; panels (g) to (1): the same for control fields. In 
most of the panels, the exposure time is sufficiently short so that individual photons can be seen. The 
white patches are an artefact from data reduction and smoothing with the EXSAS software package. 
The extended sources in panel (b) show two EMSS foreground clusters listed in Tab. 6 



Uitruse A-ray emission around high-redshitt, radio-loud ybUs 



source fields 



control fields 




Fig. 3. Rank-order correlation results for 
source fields (left column) and control fields 
(right column). Field subsamples become 
brighter to the right (decreasing m max ), 
correlations and anticorrelations become 
more significant to the top and the bottom 
of the plots (decreasing and increasing error 
level e), respectively. The thick lines show 
e (the error level from the data), the thin 
lines e' (the median error level from boot- 
strapped data samples), and the error bars 
show the range Ae within which 80% of the 
bootstrapped data samples fall. The three 
rows show the results for different redshift 
intervals. Line types distinguish between 
the energy ranges E 1 > {0.75, 1.0, 1.5} keV 
(solid, dotted, and dashed lines, respec- 
tively) 
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Fig. 4. Similar to Fig. 3, but for z max = oo; 
the panels in the three rows show the results 
for different lower redshift limits z m j n 



For more detailed information, we have also listed the results in Tables 1,2, and the 
bootstrapping results in Tab. 3. 



The bootstrapping results confirm the results obtained from the original subsamples, 
indicating that the results are not dominated by a few sources only, and the errors Ae 
are usually small compared to e or e'. 

It is surprising that no significant correlation of X-ray emission with intermediate- 
redshift 1-Jansky sources (1.0 < z < 1.5) is found, while closer as well as more distant 
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Table 1. Correlation results obtained from source and control fields (data displayed in Fig. 3). z m i n ,max 
are the limiting redshifts of the source subsamples, m max is the maximum optical magnitude of the 
sources, and N is their number in the subsample. e>£ is the error level of the correlation in percent, 
where E is the minimum photon energy in keV 
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sources show highly significant correlations with X-ray emission in at least one energy 
band. 

The comparison between results obtained from source and control fields becomes 
more pronounced in Fig. 4. The striking difference between them is immediately apparent, 
and, unexpected at first sight, the control fields exhibit significant anti correlation with 
X-ray emission. Before we turn to a likely explanation of these anticorrelations, we 
employ the Kolmogorov-Smirnov test to see whether the distributions of individual error 
levels from source fields, from control fields, and the theoretically expected distribution 
of error levels, can significantly be discerned. 



5 Kolmogorov-Smirnov tests 

Given two (normalized, cumulative) distributions Pi^{x) of the same, single independent 
variable x, the Kolmogorov-Smirnov (hereafter KS) test yields the statistical error level 
cks for these distributions to be different (cf. Press et al. 1992). We have three distri- 
butions to compare: the distribution of individual correlation coefficients of the source 
fields, the corresponding distribution for the control fields, and the theoretically expected 
distribution of correlation coefficients [e.g., Eq.(5) of BS3] for randomly distributed count 
ranks. The KS test uses the maximum of the absolute value of the difference between 
the two distributions to be compared. This means that it is efficient in detecting a hori- 
zontal shift between the two distributions, especially if they are steep, but less efficient 
in detecting a deformation of one distribution with respect to the other which leaves 
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Table 2. Correlation results obtained from source and control fields (data displayed in Fig. 4). The 
symbols are defined in the caption of Tab.l 
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the median of the distributions approximately unchanged. We therefore expect that the 
KS test will be less sensitive to detect significant differences between source and control 
samples, cks is the statistical significance for two distributions to be equal; conversely, 
it can be considered the error level of the hypothesis that the two distributions differ. 
Thus, a small value of cks indicates a significant difference between the two distributions. 

We have three different distributions to compare: the distribution of correlation 
coefficients of individual source- and control fields, P s (r COTT ) and P c (^corr), respectively, 
and the Student-t distribution Pt(r corr ) theoretically expected for randomly distributed 
photons in the fields. These three distributions are plotted in panel (a) of Fig. 5 for 
E 1 > 0.75 keV, m max = 19, and z ni [ n = 0.5. 

In addition, Fig. 5 displays cks as a function of the limiting redshift z ni [ 11} for pho- 
ton energies > 0/75 keV and m max 6 {18,19,20} (solid, dotted, and dashed lines, 
respectively). Panel (b) shows cks for the comparison of P s with P c , panels (c) and (d) 
for the comparison between P S;C and P t . There are redshift ranges exhibiting significant 
differences between the two distributions, e.g., for 0.5 <^ z m \ n <J 0.8 and, in particular, 
for 1.5 <J z m ; n <J 2.0. This reflects the aforementioned observation that there is an in- 
termediate redshift interval where 1-Jansky sources show no significant correlation with 
X-ray emission, and that the most significant correlations occur for high-redshift, opti- 
cally bright 1-Jansky sources. For m max = 18 (solid curves), there are probably too few 
data points for the KS test to work reliably. 

The anticorrelations found in part of the control-field subsamples also show up in 
some curves in panel (d) of Fig. 5, where the deviation between the distributions comes 
from the large fraction of small correlation coefficients, in contrast to panel (c) of Fig. 5, 
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Table 3. Correlation results obtained from bootstrapping source and control fields; for the definition of 
the symbols, see the caption of Tab.l 
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Fig. 5. Panel (a): the distributions Pt,s,c 
(solid, dotted, and dashed lines, respec- 
tively) for £/ 7 > 0.75 keV, m max = 19, 
and ,z m j n = 0.5; panels (b,c,d): results 
e Ks(^min) °f a KS comparison between pairs 
of these distributions. The abscissae are 
limiting source redshifts z m - ln , i.e., from left 
to right the source subsamples become more 
distant. The ordinate is the error level for 
the distributions to be different, i.e., the sig- 
nificance for them to be different increases 
downward. The three curves per panel are 
for different optical threshold magnitudes of 
the source subsamples, namely for T7i m ax £ 
{18, 19, 20} (solid, dotted, and dashed lines, 
respectively) 



where the reason for the significant deviations is the occurrence of very large correlation 
coefficients in the source sample. Note that the results for m max = 18 are again the least 
significant because of the small numbers of sources. 
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Fig. 6. Simulated cumulative distributions 
of rank-order correlation coefficients. In all 
four panels, the dotted curve displays the 
theoretically expected distribution for ran- 
domly distributed ranks. The simulations 
for panel (a) were purely random fields, 
while in the simulations for the other pan- 
els, we have added sources with different 
widths <7 S to one half of the fields (/ = 0.5) 

6 Origin of anticorrelations in control fields 

At first sight, the highly significant anticorrelations found in many of the control- field 
subsamples (see, in particular, panel (d) of Fig. 5 and Tab.l) are a puzzling result. Look- 
ing into a randomly selected direction on the sky, a random portion of the X-ray emission 
should be seen on average, and it is therefore striking to find the emission anticorrelated 
with the randomly chosen control-field centers. 

The explanation, however, is rather straightforward. The X-ray emission is due to 
both diffuse and discrete sources, where the image of such a discrete source can have a 
diameter of even a few arc minutes dependent on its brightness; bright sources appear 
more extended than dim sources 2 . Consider now a field containing randomly distributed 
photon events, and, in addition, the photons from randomly placed discrete sources with 
given extension. For the purposes of rank-order statistics, this field is then covered with 
25 ring-shaped, concentric cells. Because these cells are chosen to have equal area, they 
become thinner outward. When the radius of the discrete source becomes of the order of 
~ 1', its photons will affect ('contaminate') several adjacent cells if the source is close to 
the outer boundary of the cell pattern, but only one or few cells if it is close to the center. 
Therefore, once the source appears large enough to cover several cells at the boundary, 
it produces a coherent 'distortion' of the count ranks of distant cells, but it affects only 
single cells close to the center. In a sample of such fields, the contamination of single cells 
close to the center is likely to be 'averaged away', but not the coherent contamination of 
multiple cells close to the outer boundary. An example of a discrete source close to the 
outer cell-pattern boundary is seen in panel (j) of Fig. 2. 

To see whether this qualitative consideration can indeed account for the anticor- 
relations detected in the control fields, we have performed Monte-Carlo simulations, as 
follows. 

In a sample of 100 circular fields with radius a/100 x ' ~ 28'.2, iVp points 

(representing photons) are distributed following a Poisson distribution with (N p ) = 500. 
Additionally, a fraction / of these fields are contaminated with randomly placed 'compact 



Additionally, the point-spread function depends on the photon energy; this dependence is, however, 
negligible compared to its broadening for off-axis sources. In contrast to pointed observations, the 
spatial photon distribution of compact sources in the All-Sky Survey is dominated by the shape of 
the off-axis PSF. 
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sources' of Gaussian profile with variance taken randomly from the interval [0.5cr s , 1.5cr s ] 
to mimic different source diameters, where a s is taken as a parameter; the number of 
'photons' per 'source' was randomly drawn from the interval [20,500]. This sample is 
then subjected to rank-order statistics alike the source and control fields. Fig. 6 shows 
the cumulative probability distribution for the correlation coefficients r corr obtained from 
100 runs with / = 0.5 and compares them with the theoretical distribution expected for 
randomly distributed ranks. 

The figure shows that, for uncontaminated fields, the simulated distribution closely 
follows the theoretical curve (upper left panel), but the deviations between the curves 
increase with increasing average width a s of the contaminating sources. Table 4 gives, 
for 10 different simulations (each containing 100 'source fields') and four values of cr s , the 
rank-order correlation error levels e together with the Kolmogorov-Smirnov error levels 
cks from comparing with P t . a s = abbreviates simulations without contamination by 
discrete sources. 

Table 4. Results from ten independent Monte-Carlo simulations of iVgeid = 100 random fields, to half 
of which (/ = 0.5) discrete sources were added. cr s is the average source diameter in arc minutes, e 
is the rank-order correlation error level, and cks is the KS error level, both in %. cr s = means no 
contamination by sources 











cr s (arcmin) 















1 


2 


3 




simulation # 


e 




e 




e 




e 




1 


22.47 


48.11 


69.45 


2.03 


99.90 


0.00 


71.99 


0.03 


2 


9.07 


25.24 


99.78 


0.03 


99.67 


0.00 


96.89 


0.00 


3 


78.09 


42.80 


94.24 


3.03 


62.75 


0.33 


93.27 


0.00 


4 


46.80 


89.40 


70.86 


1.23 


5.02 


0.00 


99.99 


0.00 


5 


4.94 


37.48 


99.92 


0.01 


97.09 


0.00 


97.25 


0.00 


6 


23.48 


84.36 


12.95 


0.12 


66.83 


0.04 


99.99 


0.00 


7 


33.44 


59.00 


35.19 


8.56 


92.64 


0.24 


99.66 


0.00 


8 


47.09 


43.97 


92.85 


0.22 


99.98 


0.00 


100.00 


0.00 


9 


80.54 


49.63 


83.30 


0.29 


86.06 


0.02 


100.00 


0.00 


10 


22.80 


72.18 


73.94 


0.06 


86.15 


0.07 


97.33 


0.00 



The table quantifies the results displayed in Fig. 6 and shows that, with increasing 
average source width, anticorrelations become more frequent and more significant (e — >■ 
100%), and the deviation of the distribution of correlation coefficients from the theoretical 
distribution becomes highly significant (exs — >■ 0%), just as described above. 

Moreover, we show in Fig. 7 the change in the KS error level cks with decreasing 
number A^ e id of simulated fields. The figure confirms that, with increasing source size, 
the correlation-coefficient distribution deviates more and more significantly from the 
theoretically expected distribution, and it also shows that the deviations become less 
significant with decreasing subsample size. The two panels in Fig. 7 were obtained with 
/ = 0.5 (left panel) and / = 0.25 (right panel); i.e., in the right panel, the number of 
contaminated fields is smaller. 

These simulations confirm the suspicion that the detected anticorrelations in the 
control fields originate from the contamination of the fields by the emission from discrete 
sources which appear extended because of the PSPC point-spread function. This also 
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Fig. 7. The figure shows Kolmogorov-Smir- 
nov error levels obtained from simulated 
fields. The fields contained either only ran- 
domly distributed points (solid lines) or 
were additionally contaminated with dis- 
crete sources of width a s £ {1,2,3}' (dot- 
ted, short-dashed, and dashed lines, respec- 
tively) as described in the text. In the left 
panel, half of the simulated fields contained 
such sources, in the right panel, one quar- 
ter. 
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Fig. 8a. For the subsample defined by z > 0.5, m < 19 and radio flux S > 1 Jy, we have plotted in panel 
(I) the mean ranks (TZs(j)) (solid curve) and {'R-c(j)) (dashed curve) of the source and control fields, 
and their difference A(lZ(j)) in panel (III). Panel (II) show the normalized average counts Cg(j') (solid 
curve) and Cc(j) (dashed curve) of the source and control fields, respectively, and their difference AC(j) 
in panel (IV). In each panel, the corresponding error levels from rank-order statistics are indicated. The 
error bars in panels (III) and (IV) are bootstrap error bars obtained as described in Sect. 4, i.e., they 
indicate the range containing 80% of all bootstrap-resampled data sets 



means that the correlations found in the source-field subsamples occur in spite of this 
effect, and therefore strengthens the significance of these correlations. 
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Fig. 8b. Same as Fig. 8a, for a subsample with z > 1, m < 19 and radio flux S > 3 Jy 



7 Further statistical tests 

Following the preceding remark, we want to further investigate the statistical differences 
between the photon counts in the source fields and their control fields. The statistical 
analysis presented in Sect. 4 was made by ranking the cell pattern of each source and each 
control field (for a chosen subsample), then averaging over all sources and control fields 
of this subsample. The resulting average rank schemes (7l(j)) for the source and control 
fields were then again ranked, and a correlation coefficient and the corresponding error 
level were determined. Henceforth, we denote the error levels for the source and control 
fields by crs and crc, respectively. Examples for the mean ranks of several subsamples 
are found in panels (I) of Figs.8a,b,c; the solid and dotted curves correspond to the source 
and control samples, respectively. Panels (III) of these figures show the difference 



A(TZ(j)) = (TZ s (j)) - (TZc(j)) 



(1) 



of the ranks of the source fields and the control fields. We have also ranked this difference 
rank scheme A(lZ(i)) and obtained a correlation coefficient and, correspondingly, an error 
level, which we denote by crx>- 
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1.50^z^2.00, m^19.00, S^1.0, N = 21 
averaged ranks averaged counts 
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Fig. 8c. Same as Fig. 8a, for a subsample with 1.5 < z < 2, m < 19 and radio flux S > 1 Jy 



In addition, we have calculated the mean photon counts of all source and control 
fields of a chosen subsample, weighted by the total photon counts in each source and 
control field in the rings 2 to 25. More precisely, if C(i,j) denotes the photon counts in 
the j-th ring of the i-th source (or control) field of a chosen subsample, we have calculated 



C(i,j) 



(2) 



these normalized average counts are plotted in panels (II) of Figs.8a,b,c for several sub- 
samples, for both the source fields (solid curves) and the control fields (dashed curves). 
We have ranked the C(j) and obtained the corresponding error levels ecs and ecc for 
the source and control fields, respectively. In addition, we have ranked the difference 



AC(j) = C s (j)-C c (j), 



(3) 



plotted in panels (IV) of these figures, and obtained the error level ccd- In Table 5, we 
list the error levels introduced above for several subsamples. 

Several trends are worth mentioning: as expected, the error levels for the differences, 
both for A(lZ(j)) and for AC(j) are smaller than crs and ecs, since the anticorrelation 
of the control fields, which we have attributed to the existence of (supposedly unrelated) 
discrete sources in the fields, must also be present in the source fields. By taking the 
difference between source and control fields, this effect should be largely eliminated from 



hurther statistical tests 



lb 



0.50^z^°o, m^19.00, S^1.0, N=192 
averaged ranks averaged counts 



7 



6 



1 1 i i i i i i i i i i i i i i i i i i i i i i 



To 




e RS =19.2%, e RC =92.5% 



1 I I I I I I I I I I I I I I I I I I L * 



i i i i i i i i i i i i i i i i i i i i i i i 



(ii) 



e cs =4.9%, e cc =93.1% 




I i i i i I i i i i I i i i i I i i i i I i i- 



1.1 



0.9 



10 20 30 40 50 10 20 30 40 50 
distance ['] distance ['] 



Fig. 8d. Similar to Fig. 8a, but for twice the angular scale (5'6 . . . 57^6). The curves, in particular those 
in the lower panels, are approximately flat for distances > 20', and rise for distances < 15'. The distance 
where the lower curves drop to zero, ~ 15', roughly corresponds to the Abell radius of galaxy clusters 
at intermediate redshifts 



the source fields, and the resulting schemes should be cleaner representations of the 
truly correlated counts. Second, the error levels obtained from the mean ranks and the 
normalized average counts are comparable, i.e., they do not differ significantly. The 
figures give a good illustration of our method, as well as its sensitivity. The correlation 
of the X-ray photons with the radio QSOs can be seen 'by eye' in these plots, best of 
course in the plots for the difference of the ranks and the difference of the normalized 
average counts. 

The correlation results should become less significant when the correlation scale is 
enlarged. To test this, we have counted X-ray photons in source and control fields in 13 
cells of equal width (we have chosen 4') covering a ring between 5'.6 and 57'.6 centered on 
the field centers, and weighted the counts by the cell area. To these counts, we applied 
the same type of analysis as described above; the results for one subsample (the same as 
in Fig. 8a) are displayed in Fig.8d. The curves are approximately flat for large distances 
(;> 20') from the center, while they rise for distances smaller than ~ 15'. For further 
discussion, see Sect.9.1. 

In order to exclude the possibility that the observed correlations are due to either the 
wings of the point spread function or to a halo around the QSOs caused by scattering 
of dust in the interstellar medium, we have repeated the correlation calculations by 
subtracting a fraction / of the photon counts in the central cell (j = 1) from the other 24 
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Table 5. Correlation results for the subsamples of Tab. 2, obtained from the methods described in Sect. 7 
applied to the energy regime E-y > 0.75 keV. cr(s,C,D) are error levels found from ranking the source and 
control fields and their differences, respectively, while ec(S,C,D) were derived from normalized photon 
counts 
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rings, according to an angular dependence oc 0~ a . We have tested the cases / = 1% and 
/ = 3%, and a = 1 and a = 2; these should provide overestimates of the possible flux from 
the QSOs spread into the outer rings (see also Fig.l). Whereas the correlation coefficients 
become slightly smaller if we allow for this spreading, in no case does a strong correlation 
become significantly weakened. We therefore conclude that the observed correlation is 
not caused by the wings of the point spread function or a scattering halo around the 
QSOs. 

Another comparison between source and control fields is illustrative: in Fig. 9 we 
have plotted a histogram of the ratio (N pos — N neg )/(N pos + N neg ), where N pos and iV neg 
are the number of fields in which the correlation coefficient in the source field is larger or 
smaller, respectively, than in the corresponding control field, for different redshift bins. 
The trend seen in the preceding tables is also visible here: the correlation coefficient in 
the source fields is larger than that in the corresponding control fields in the majority of 
cases, if the redshift is either smaller than about 1 or larger than about 1.5, whereas no 
clear trend is visible in the intermediate redshift regime. 

In accordance with the results in Sect. 4, the correlation analyses presented here do 
not yield a significant correlation of QSOs in the redshift interval 1 < z < 1.5 with X-ray 
photons. If the correlations that we observe for other redshifts are due to a magnification 
bias caused by lensing, we expect the correlation coefficients to increase if we increase 
the radio flux threshold of the QSO sample, since the radio source counts are steeper for 
the highest fluxes. Thus we repeated the statistical analysis by considering only QSOs 
with radio flux larger than 2 or 3 Jy; the results for the redshift interval 1 < z < 1.5 are 
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Fig. 9. A histogram of the ratio (N pos — 
Nneg) I (N pos + iV neg ) , where 7Vp OS and jV neg 
are the number of fields in which the corre- 
lation coefficient in the source field is larger 
or smaller, respectively, than in the corre- 
sponding control field, for different redshift 
bins. The pair of numbers in each bin de- 
notes (7Vp OS /N pos + N neg ) 

displayed in Tab. 6. As one can see, increasing the radio flux threshold of the sample, 
the correlations, which can not be detected at a significant level for a threshold of 1 Jy, 
become visible for a larger radio flux threshold. This trend is present in all subsamples, 
unless the number of sources in the subsample becomes smaller than about 5. Therefore, 
we conclude that the trend of the correlation coefficient, or the corresponding error level, 
is in agreement with expectations based on a gravitational lens interpretation of our 
results. A discussion will be presented in Sect. 9. 



Table 6. Correlation results for the subsamples of Tab.l, using the methods of Sect. 7. For the meaning 
of the symbols, see the caption of Tab. 5 
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8 Cross-identifications of correlated sources 



8.1 Close X-ray clusters 

There is yet no catalog of X-ray bright clusters covering the whole sky. The Extended 
Medium Sensitivity Survey of the EINSTEIN observatory (hereafter EMSS) covers only 
~ 780 square degrees, i.e., ~ 1.9% of the whole sky. It is therefore not yet possible to 
decide which cases of strong correlation between a background source and X-ray emission 
are due to an X-ray luminous cluster. Nevertheless, there are two 1-Jansky sources having 
a correlation error level of less than 10% in all three photon-energy bands which ly within 
60' of an EMSS cluster. One of them, Jy2216 - 038, was already found in BS3 to be 
strongly correlated with Lick galaxies. Table 6 contains the relevant data of the two 
sources and their foreground clusters. 

Table 7. Strongly correlated 1-Jansky sources (e < 10% in all energy bands) with an EMSS cluster in the 
foreground (^duster < ^source) which is less than 1° away from the source. AO is the angular separation 
between source and cluster center in arc minutes, Lx is the (Einstein-de Sitter) X-ray luminosity of the 
cluster in 10 44 h~ 2 erg/s. The clusters close to Jy2216 — 038 are seen in panel (b) of Fig. 2 
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8.2 Sources correlated with galaxies and X-ray emission 

The correlation analyses presented in BS2,3 have focussed on associations of foreground 
galaxies with background QSOs; the present study investigates associations of diffuse X- 
ray emission with the same sample of background objects. It was shown above that X-ray 
emission by discrete sources tends to introduce anticorrelations rather than correlations, 
which further indicates that we, in cases where correlations are found, deal with emission 
from extended sources. Galaxy clusters are the most likely objects in this respect. If the 
X-ray emission from galaxy clusters causes at least part of the correlations found here, 
there should be 1-Jansky sources which are strongly correlated with galaxies and X-ray 
emission. This is indeed the summarized in Tab. 8. 

The table lists 1-Jansky sources which are strongly correlated (here, r corr > 0.2) 
with both Lick galaxies and X-ray emission (left part of the table) and with both IRAS 
galaxies and X-ray emission (right part). The table also distinguishes between the three 
photon-energy bands. 

First of all, the number of multiple correlations decreases with increasing photon 
energy. This is at least partially due to the decreasing photon number density with in- 
creasing photon energy, causing larger relative random fluctuations in the photon counts 
per cell, and therefore also in the count ranks, and thus yielding less significant corre- 
lation results. The 1-Jansky sources in the table are probably not physically associated 



bummary and discussion 



Table 8. 1-Jansky sources which are highly correlated with X-ray emission and either Lick or IRAS 
galaxies. mQso is the visual magnitude of the source, zqso its redshift, and r^ r r ^ y ' Llck ' IRAS are the 
correlation coefficients with the respective foreground source types 
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with the clusters potentially responsible for the correlations with both X-ray emission 
and galaxy counts, because most of them are at redshifts where clusters are unlikely to 
be detected in the All-Sky Survey and in the galaxy surveys. 



9 Summary and discussion 



9.1 Data and results 

We have investigated whether diffuse X-ray emission in the energy range E 1 E [0.75, 2.2] 
keV is correlated with 1-Jansky sources above redshift 0.5 on scales of :> 10'. For that 
purpose, we have applied rank-order statistics as described in BS1,2,3 to fields taken from 
the ROSAT All-Sky Survey. Of these fields, 246 were centered on the positions of the 
1-Jansky sources, and 246 were chosen such that they had equal galactic latitude as the 
source fields, but random galactic longitude. Subsamples of source fields were selected 
by the redshifts and the optical (visual) magnitudes of the sources, to see whether the 
results depend on these parameters. Each control field was formally assigned the redshift 
and the optical magnitude of the source field with the same galactic latitude, so that the 
choices of redshift- and magnitude intervals equally changed the number of fields in the 
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control sample. The 'strength' of correlations is quantified via the rank-order correlation 
coefficient r corr , and, equivalently, the correlation error level e. 

The results presented in Figs. 3, 4 of Sect. 4 and in Tabs. 1,2 show that 

(1) there is evidence at a significance level of up to 99.8% for correlations between distant 
(z ;> 1.5), optically bright (m <J 19) 1-Jansky sources and diffuse X-ray emission 
quite irrespective of the X-ray photon energy E^; 

(2) there is evidence on a slightly less significance level (< 97.3%) for correlations be- 
tween 'close' (z <j 1.0) 1-Jansky sources and diffuse X-ray emission with ;> 
0.75 keV; 

(3) in the intervening source redshift interval 1.0 <J z <J 1.5, there is no sign of sig- 
nificant correlations, irrespective of the source-subsample parameters and the X-ray 
photon energy; however, when the radio flux threshold is increased, highly significant 
correlations are obtained (cf. Tab. 6); 

(4) the results obtained from source fields and those obtained from control fields are 
clearly distinct, to be seen most clearly in Fig. 4; 

(5) the control fields tend to show anti correlations with X-ray emission, which reach a 
significance of up to 100%. 

Although on a weaker significance level, the Kolmogorov-Smirnov tests performed 
in Sect. 5 support the results listed above. Also, the results from bootstrapping source- 
and control fields indicate that the correlation- or anticorrelation effects are not due to a 
few sources, because a bootstrapped sample misses on average 1/e ~ 37% of the original 
members. 

Item (5) of the above list, disturbing at first sight, finds a simple explanation in 
Sect. 6, where we show with the help of Monte-Carlo simulations that these anticorrela- 
tions are due to discrete sources whose emission is superposed on the otherwise diffuse 
X-ray background. This explanation also means that the correlations found in the source- 
field subsamples actually are more significant than given in the left columns of Figs. 3, 4 
and Tab.l, because the correlations have to 'come up against' the anticorrelations which 
are naturally present because of the discrete-source contamination of the fields. It is, 
however, hardly possible to quantify this effect because it depends on the extension of 
the contaminating sources, which in turn depends on their brightness and their hardness. 

The reason for item (3) of the above list, i.e., the existence of a source-redshift 
interval where no significant correlations are found, is unclear to us. Again, the boot- 
strapping results indicate that the lack of significant correlations in this redshift band 
is not caused by a few sources only. Possibly, the correlations found for 'low' source 
redshifts (z <J 1.0) are caused by different sources of diffuse X-ray emission than in the 
case of distant sources (z J> 1.5). Some support for such a hypothesis may be that the 
distant sources are correlated with soft and hard X-ray photons, while the close sources 
are correlated with soft X-ray emission and show no significant correlation with hard 
X-ray emission. If, however, the correlations for close sources were due to possible host 
clusters of the sources themselves, these clusters would be at redshifts 0.5 <J z <J 1.0, 
and the maximum of their X-ray luminosity should be around ~ 2 ... 3 keV, i.e., in they 
should be brightest in the 'hard' photon band of our study (E 1 e [1.5,2.2] keV), while 
we see the correlations in this redshift range only for softer photons. In the 'hard' band, 
however, the photon number density is lowest and therefore most affected by Poisson 
noise. 

The curves in panels (III) and (IV) of Fig.8d drop to zero at ~ 15', an angular scale 
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roughly corresponding to the Abell radius of galaxy clusters at intermediate redshifts. 
Closer inspection reveals that this increase towards the field center is primarily due to 
sources in the redshift regime [0.5. . .1.0], while it is less pronounced and off-center for 
higher-redshift sources. This could mean that the enhanced X-ray emission around 1- 
Jansky sources is due to the emission of possible QSO host clusters. If, as we will argue 
below, the X-ray emission around high-redshift 1-Jansky sources is due to gravitationally 
lensing galaxy clusters in their foreground, these clusters would not be expected to be 
centered on the background sources, and they would also appear less extended because 
of their larger average distance and their weaker average flux. 

9.2 Interpretations and prejudices 

It can definitely be said on the basis of the above that the positions of 1-Jansky sources 
do not resemble randomly chosen positions on the X-ray sky as seen by ROSAT in the All- 
Sky Survey: the control fields, which were randomly selected, show a distinctly different 
behaviour than the source fields. We have taken care to cut out the X-ray emission of the 
1-Jansky sources themselves by omitting the central circle of 100 square arc minutes of 
each field, which should definitely be safe in view of the PSPC point-spread function. The 
correlation scale is on the order of 10' . . . 20', corresponding to ^ 5/h Mpc at intermediate 
redshifts, h being the Hubble constant in units of 100 km/s/Mpc. This is a scale typical 
for galaxy clusters, and at the same time clusters are the largest contingent X-ray emitters 
known in the Universe. 

If the sky positions of 1-Jansky sources are distinguished from random positions by 
being close to galaxy clusters, two possibilities arise: either the 1-Jansky sources are 
physically associated with the clusters, or they are gravitationally lensed by them. Let 
us assume for the moment that the detected correlations are due to the possible host 
clusters of the 1-Jansky sources. Then, the clusters have to have the same redshifts as the 
1-Jansky sources. From the excess counts in the central region of the source fields (see 
Fig.8d, panel (II)) and (n 7 ) ~ 0.2 arcmin -2 , we estimate an excess count rate from the 
hypothetical clusters of 10~ 2 s _1 . Assuming a thermal spectrum of 10 7 K and a mean 
galactic hydrogen column density of 3 x 10 20 cm -2 , we estimate an average luminosity of 
8 x 10 43 /i _2 erg/s for such clusters at a redshift of 0.5. If, e.g., ~ 10% of all 1-Jy sources 
have a possible host cluster, they do not need to be extremely luminous to be statistically 
detected in the All-Sky Survey, although they would be much too faint to be individually 
detected. This means that for 0.5 <; z <J 1.0 we might indeed see a possible host-cluster 
emission, with the tendency that optically brighter 1-Jy sources have a more luminous 
host. For higher redshifts, however, the clusters would have to exceed a luminosity of 
10 45 h~ 2 erg/s to be detected. We therefore conclude that possible host clusters may 
indeed cause the correlations seen for the lowest source redshifts, but that this is very 
improbable for the highest redshifts, since then the clusters would have to be extremely 
luminous, L x ^ 10 45 -- 46 /t~ 2 erg/s. 

On the other hand, it is striking to see in Tab. 8 that there is a number of highly 
correlated 1-Jansky sources which are correlated not only with X-ray emission, but also 
with either Lick or IRAS galaxies. If the X-ray emission is due to clusters, the correlated 
galaxies should belong to these clusters. The galaxies in the Lick or IRAS catalogs, 
however, are certainly in the foreground of the 1-Jansky sources, because the catalogs 
are not deeper than z <J 0.2 for Lick and z <; 0.4 for IRAS (for the Lick catalog, see 
Shane & Wirtanen 1967, Seldner et al. 1977, for the IRAS Faint Source Catalog, cf. 
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Strauss et al. 1990, Yahil et al. 1991, and for the redshift distribution of IRAS galaxies, 
see Saunders et al. 1990). This indicates that at least for those sources which appear 
in Tab. 4, if the X-ray emission comes from clusters, they should be in the foreground, 
which then can hardly avoid lensing the background sources, thereby magnifying them, 
and causing a magnification bias. The sources listed in Tab. 8 are also natural targets 
to look for coherent distortions of the images of background galaxies (cf. Bonnet et al. 
1993). 

Further support for the lensing hypothesis is provided by the result (apparent in the 
left columns of Figs. 3, 4 and Tabs. 1,2) that the correlations (1) are stronger for high than 
for low source redshifts and (2) that the correlations tend to strengthen when the source 
subsamples become optically brighter. If the radio emission of the 1-Jansky sources is at 
most weakly correlated with their optical emission, the multiple- waveband magnification 
bias effectively steepens the joint luminosity function of the sources and thereby enhances 
the magnification bias from lensing. The strengthening of the correlations with increasing 
optical source brightness therefore tentatively argues for the lensing effect. Note also 
in Tab. 2 that strong correlations for optically bright, distant 1-Jansky sources appear 
despite the fact that the source subsamples are very small, indicating that most of the 
sources in such subsamples are highly correlated with X-ray emission and not just a few. 
For increasing source redshift, we expect the lensing frequency to rise, because the optical 
depth for magnification increases with increasing volume between the sources and the 
observer. 

A final, admittedly conjectural argument for the X-ray correlation to come from 
the foreground and not from host clusters goes as follows. Observations of clusters 
show that clusters probably are young (e.g., Geller & Beers 1982, Gioia et al. 1990, 
Henry et al. 1992, Henry 1992), and it is also expected from theoretical arguments that 
the spatial number density of clusters evolves rapidly between z <; 0.7 and today, if 
the density of the Universe is close to its critical (Einstein-de Sitter) value (Richstone 
et al. 1992, Bartelmann et al. 1993). This means clusters should be rare at redshifts 
z ;> 0.5, and therefore also possible host clusters of 1-Jansky sources should be a negligible 
phenomenon for distant sources. Moreover, the evolution of the X-ray luminosity function 
of clusters shows that the spatial number density of X-ray luminous clusters decreases 
with increasing redshift, making it improbable that the correlations of high-redshift 1-Jy 
sources with X-ray emission is due to clusters physically associated with them. 

In any case, Tab. 7 shows that there are at least two sources in our sample, Jy2216 — 
038 and Jyll27— 145, which are 'close' to known X-ray luminous foreground clusters. If 
the clusters can be modelled by singular isothermal spheres with a velocity dispersion of 
~ 1200 km/s, they are expected to magnify the sources by <; 10% because of the large 
distance between the cluster centers from the sources. 
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